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Abstract 



We prove that the quantum relative entropy is a rate function in large deviation 
I principle. Next, we define information criteria for quantum states and estimate the 

accuracy of the use of them. Most of the results in this paper are essentially based 
■ on Hiai-Ohya-Tsukada theorem. 

d' 

S: 1 Introduction 

^ ■ In recent years there is a high level of interest in quantum information theory among 

^ ■ researchers. In particular, they are concentrating their studies on control and estimation 

. of quantum states in many systems in order to put it to practical use. 

CN i Quantum estimation theory is one of the areas of quantum information theory, which 

CN I was established in 1970's [HI [19]. Statistical decision theory and hypothesis testing for 

^ ■ quantum systems were mainly studied in the beginning stage of this theory. Recently 

various methods such as information geometry, asymptotic theory, and Bayesian analysis 
are studied theoretically and applied to many experiments. Especially, it is not too 
much to say that quantum estimation theory made the monopoly of optimization of 
}Ji \ measurement. 

^ ■ Our main interest in the paper is asymptotic theory based on large deviation principle. 

This is the reason why we use the relative entropy, which plays the role of a rate function in 
large deviation principle and is a typical quasi- distance between probability distributions. 
The quantum relative entropy is defined analogusly and becomes a quasi- distance between 
quantum states. In early investigations [351 [371 136] , the possibility that it is a rate function 
in large deviation principle was studied in the context of variational principle. No one 
however could answer whether this conjecture is correct in a general setting. 

A partial answer is given by a quantum version of Stein's lemma in [T8 l [25 1 12^. which 
is conceptually different from the original Stein's lemma and from that in this paper. 
By contrast, a quantum version of Sanov's theorem which is widely accepted has not 
appeared [15]. There is only one proposal given in [6]. 

We will show that some of classical statistical methods are applicable to quantum 
systems, owing to the universality of statistics and information theory. As a part of this 
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attempt, we will answer the previous question whether the quantum relative entropy is 
a rate function. In the concrete, a quantum version of Stein's lemma and of Sanov's 
theorem are proved in Section 4. 

Next in Section 5, quantum model selection will be discussed. We try to apply in- 
formation criteria for quantum states. Information criteria for probability distributions 
are usually described in terms of the relative entropy, but we derive those for quantum 
states from the quantum relative entropy. In preceding investigations [ISIHT], information 
criteria are applied to probability distributions calculated from POVM (positive operator- 
valued measures). Accordingly, the accuracy of the estimation by the use of information 
criteria for quantum states has not been discussed. 

Mathematical and physical backgrounds in this paper are algebraic quantum theory, 
measuring processes, and Hiai-Ohya-Tsukada theorem. We begin with algebraic quantum 
theory before stating main results. Its mathematical basis is the theory of operator 
algebras to clarify the structures of C*-algebras and of von Neumann algebras, the spaces 
of linear functionals on them and so on [TJ EHl EH]- C*-algebras are mainly used in 
the present paper, which describe observables of the system, and equivalence classes of 
their ^-representations, called sectors, play the essential role of classifiers of macroscopic 
structures in quantum systems. We describe measuring processes within the framework 
of algebraic quantum theory [131 l2H]- We use the (C*-)tensor product of the algebra of 
the system and of an apparatus as the algebra of observables of the composite system. 
Measuring interactions are then defined as automorphisms on this tensor algebra. After 
these preparations, we can apply Hiai-Ohya-Tsukada theorem [T71 Theorem 3.2]. This 
theorem is crucial for relating the quantum relative entropy with the relative entropy of 
probability distributions of measured data. We would like to mention that the methods 
developed in the paper arise from the context of Large Deviation Strategy [22] . 

Finally, in Section 6, we will discuss the quantum a-divergence [3] as a quasi-entropy 
formulated by D. Petz [31]. See [3] for its fundamental properties. We will show that the 
quantum a-divergence is applicable in the (C*-) algebraic setting. First, for this purpose, 
the a-analogue of Hiai-Ohya-Tsukada theorem is proved. Secondly, a quantum version of 
Chernoff bound is proved, which is different from that in [5] |231 El EI]- This is a large 
deviation type estimate, too. Lastly, we define a Bayesian a-predictive state and prove 
that this minimizes a risk function constructed of the quantum a-divergence. This result 
generalizes those in [H HH HQ] . 

2 Algebraic Quantum Theory and Measuring Pro- 
cesses 

2.1 Algebraic Quantum Theory and Preliminaries 

Algebraic quantum theory begins with a C*-algebra 21 of observables of the system. A 
C*-algebra 21 is a *-algebra over C, i.e., an algebra over C with the involution * : 21 — )■ 21 
defined by 

{aA + bB)* = aA* + bB*, {ABy = B*A*, A** := {A*)* = A, 

for ^4, S e 21, a, 6 G C, where a is the complex conjugate of a, and also is a Banach space 
with the norm || - || : 21 — )■ M>o = {a G M | a > 0} which satisfies, for A, 5 G 21, a, 6 G C, 

< ||A|| = pf , ||y4*|| = 
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In particular, self-adjoint elements A = A* of 21 are called observables. C*-algebras 21 are 
assumed to be unital, i.e., 21 have the unit 1 in the paper. Next, a state on 21 is defined 
normalized positive linear functional: 



for A,B G 21, a, 6 G C. We denote by E<n the set of states on 21. As is seen by the 
definition, the concept of a state is nothing but a complex-valued output mapping. The 
following theorem then holds: 

Theorem 1 (GNS construction theorem [3 [38]). Let "Ql be a C* -algebra and u a state on 
21. Then, there exist a Hilbert space with the inner product (■, ■), a vector G "H^, 
and a *-homomorphism tt^^ : 21 — )■ -B called a ^-representation (a representation, 

for short) of 'Qi, such that uj{A) = {Qi^,'Ki^{A)Qi^) . The triplet (tTi^jHuji^lj) is called a 
GNS representation of% with respect to uj. 

Furthermore, two GNS representations (tt^^,!, "H^^,!, ('iTw,2 5 "^^,25 ^^,2) 0/ 21 with 

respect to uj are unitarily equivalent, i.e., there exists a unitary operator U : Tiu),! — ?■ 'Hu),2 
such that 7r^,2(^) = U7[^^i{A)U* /or A G 21 and Q^^2 = U^uj,i- 

We denote by QV^ the set of positive linear fuctionals on 21. The GNS construction 
theorem holds for each (/> G 2l!j_. Let "H be a Hilbert space. For each subset Ai of 
B{'H), let M' denote the set of all elements in B{'H) commuting with every element 
in A^. A subalgebra Ai of B{'H) is called a *-subalgebra, if Ai is invariant under the 
involution. Then, a von Neumann algebra on "H is a *-subalgebra oiBil-i) such that 
M. = Ad" := {At')', and a factor is a von Neumann algebra Ai with trivial center 
3(A^) := Ai n Ai' = CI. The center 3{Ai) of the von Neumann algebra is a unique 
maximal abelian *-subalgebra of Ai commuting with every element of AI. It is well known 
that, for a subset J' of B{'H) which is invariant under the involution, J" is the smallest 
von Neumann algebra containing J . It is called the von Neumann algebra generated by 
J . A typical example can be given by the von Neumann algebra generated by a GNS 
representation (tt^, "H^, fi^) of 21 with respect to uj. Since the subset 7r^(2l) = {71^^(^4)174 G 
21} of B{l-i^) is invariant under the involution, 7rtj(2t)" is the smallest von Neumann algbra 
on l-i^^ generated by vr^(2l). 

Let TT be a representation of 21. A state a; on 21 is said to be yr-normal if there exists a 
normal stat^ p on 7r(2l)" such that ijj{A) = p{7t{A)) for every A G 21. Two representations 
7ri,7r2 are quasi-equivalent, denoted by tti ~ 7r2, if each TTi-normal state is 7r2-normal and 
vise versa. As a complement, two representations ni,ii2 are disjoint, denoted by tti ci 7^2, 
if no TTi-normal state is 7r2-normal and vise versa. A state a; on a C*-algebra 2t is called 
a factor state if the center 3^(21) := Huji^)' H 7r(^(2l)" of the corresponding von Neumann 
algebra 7r^(2t)" is trivial. We denote by F<^ the set of factor states on 21. 

Definition 1 ([26j). A sector of C*-algebra 21 is defined by a quasi- equivalence class of 
factor states ofOi. 

A key concept of this paper is the sector defined above. Before mentioning how to use 
it, we introduce the central measure p^ of a state uj on 21. For a positive linear functional 
a; on 21, a positive regular Borel measure p on E'a is called a barycentric measure of uj, 
which is called the barycenter of p and denoted also by h{p), if it satisfies 



positive linear functional p on a von Neumann algebra Jv[ on % is said to be normal if there exists 
a positive trace class operator a G T{'H)+ such that p{A) — TT[aA] for each A e A4. 



a;(l) = 1, u{A*A)>0, uj{aA + bB) = a uj{A) + b u{B), 
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It is proved in [TJ Theorem 4.1.25] that there is a one-to-one correspondence between 
the abehan von Neumann subalgebras B of vr(^(2l)' and the orthogonal measure^ fx =: 
of u. In the case that i3 is a subalgebra of the center 3^(21), the corresponding 
orthogonal measure fijg is called a subcentral measure of u and satisfies the following 

property: for each A G B{E(n), GNS representations n^^ and tt^^ of wa = / P dfisip) 

and uje^\a = / P dp^ip)^ respectively, are unitarily equivalent to the subrepresentations 

of Hi^j, and are disjoint each other. In particular, /i^ := /i3„(2i) is called the central measure 
of u), and pseudosupporteclf] by the factor states F^.That is, the central measure fi^) of a 
state CO decomposes its barycenter into different sectors. It is also seen that each state 
Lo has a unique central measure /z^ and is decomposed into sectors by /i^;- The physical 
interpretation of a sector will be discussed in the next subsection. 



2.2 Composite Systems and Measuring Processes 

A measuring process is a vital physical process to characterize quantum systems and 
access actual stuations. It is widely accepted that apparatuses interact with the system 
under consideration and output values of observables. This fact is formulated here by a 
composite system of the system and an apparatus. 

Let 21 be a C*-algebra of the system under consideration, u a state on 21, and 
{t^uj,'Huj,^uj) a GNS representation of 2t with respect to u. We select an observable A = 
A* e TTcj{^)" to be measured and use Co{A) := {f{A)\f G Co{Sp{A))} as the algebra of an 
apparatus measuring A, where the spectrum Sp{A) := {A G C| A— XI is not invertible}(C 
M) of A. Then, the algebra of the composite system of the system and the apparatus is 
defined as 21 Co(^)u- The von Neumann algebra of this algebra in the representation 
TTuj ® id is calculated as 

{(tt^ ® td)iQl ®^ Co (A))}" = 7r,(2l)" ® A, 

where A = Co{Sp{A))" is a von Neumann subalgebra of BiTi^), with center 

3..®id(2l 0^ Co (A)) := {(tt^ ® 2d) (21 ®^ Co (A))}" n {(tt^ ® zd)(2l ®^ Co(A))}' 
= (7r^(2l)"®^)n(7r^(2l)"®^y 
= (7r^(2iy'®^)n(7r^(2iy®^') 
= (7r^(2iy'n7r^(2t)')® (^n^O 
= 3c.(2t) ® A 

Therefore, the algebra of observables of the apparatus is contained in the center of the 
algebra of the composite system. Assume, from now on, that the state u of the system is 
a factor for simplicity. 

Next, let us consider the definition of measuring interactions and their physical mean- 
ing. In this paper, measuring interactions are defined by automorphisms on the algebra 

2 See in [71 Section 4.1]. 

•^A measure /i on a Borel space K is said to be pseudosupported by an arbitary set A C K ii fJ.{B) = 
for all Baire sets B such that B D A = (d. 

^The injective C*-tensor product 2ti ^2 = 2ti (8)mi„ 2t2 of two C*-algebras 2ti and 2t2 is the 
completion of the algebraic tensor product ^i(E)aig^2 by the norm 1 1 ■ 1 1 min defined by 1 1 C 1 1 mi„ = sup 1 1 (tti (gi 
7r2)(C)||, C e 2li (i^aig ^2, whcrc TTi and tt2 run over all representations of 2li and 2t2, respectively. See 
[38] . in detail. 
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7ra;(2t)" A of the composite system. When we fix a standard form@ of the von Neumann 
algebra 7r^(2l)" ® A, any automorphism a is unitarily implemented, i.e., there exists a 
unitary operator U such that a{A) = U*AU, A G A. Owing to this result, we 

can reproduce unitary opearators on a Hilbert space as measuring interactions in general 
theory of measuring processes in Hilbert space formalism. Let a be an automorphism 
on a von Neumann algebra A4. We define a* : Em — )■ Em by {a*u){B) = u{a{B)) for 
B eM. 

The most important and fundamental measuring interaction is that of ideal measure- 
ment of A denoted by a{W), which is to be combined with the concept of a neutral 
position, vital for the description of function of apparatuses [I3]. A neutral position, 
denoted by m_4, is the "universal" initial state of the apparatus and correponds to the 
macroscopically stable position of the measuring pointer realized when the apparatus is 
isolated (for the detail mathematical formulation, see |13]). a{W) is an automorphism on 
TT^{Ql)" ®A such that 

(B) := a(Vr)*(w®m^)((7r^®2rf)(S)) 

= j{uJa®m5a){B)P{Aeda\\u), (1) 

for 5 G 21 ®ra Co(A), where P{A G A||w) = {Vt^, E^{A)Q^), E^ the spectral measure of 

A such that A = J a dE"^{a), A G i3(]R). coa = uJaO tt^ such that UaifiA)) = /(a), Sa the 

eigenstate on Co{A) for a G Sp{A). The interaction a(W^) of ideal measurement should 
be regarded as a trigger of a perfect correlation [32j between the state of the system and 
of the apparatus. 

Then, Eq.([T]) can be rewritten as 

uj^A,a{W))(B) = j p{B) rf^^(^ ^,(p), (2) 

SUpp /ia;(^_„(^)) = {uJa ®m 5a\a G Sp{A)} , 

for 5 G 21 (8>m C'o(^)- This is because the algebra 21 ®m Cq{A) of the composite system 
has center CI ® A, and a family of the eigenvalues of an observable A become an index 
for classifying sectors. This result shows us that the central measure of a state of the 
composite system after measurement play the role of a probability measure appearing 
in Born rule [30] ■ We can generalize the above argument for any automorphism a on 
® A, and discuss the optimization of measurement. 

a*{u®mj^){{Tx^®id){B)) 
j p{B) d^^(^ ^j(p), (3) 

{Wa,a ®m 5a|a G Sp{A)}. 

for i? G 21 ®m Cq{A), where, for each a G Sp{A), uja,a depending on a is the state of the 
system when the value a G Sp{A) is measured in an apparatus. Therefore, we use the 
central measure of a state of the composite system after measuring interaction. If u above 

'"^A standard form of a von Neumann algebra on a Hilbert space H is a quadtuple {AijH, J,V) 
consistuting of M, H, a unitary involution J, a self-dual cone V in H, which satisfy the followings: (i) 
JMJ = M'; (ii) JAJ = A*,Ae 3(A^); (iii) J£. = Li&V] (iv) AJAJV cV,AgM. See [31]. 



^{A,a){B) := 

SUpp fJ'UJ,A,a(W)) ~ 



5 



is not a factor, we should replace the central measure /i^jj^ by the subcentral measure 
of 0J(^A,a) corresponding to the abelian von Neumann subalgebra CI ® A. 

We close this section with discussion on the concept of a sector. A sector can be 
interpreted as a physical origin of macroscopic indicators of classification, and disjoint- 
ness among different sectors represents a "conditional" stability of macroscopic structures 
[271 126] . As a typical example, sectors in the composite system of the system under 
consideration with an apparatus are classified by the eigenvalues of an observable to be 
measured, which was already discussed. We would like to emphasize that quantum mea- 
surement theory requires searching nontrivial intersectorial structure in each sector of the 
system. On the other hand, non-factor states of the system correspond to such physical 
situations that order parameters like temparature are assumed before any measurement. 
This case is often overlooked. 



3 Hiai-Ohya-Tsukada Theorem and its Application 
3.1 Hiai-Ohya-Tsukada Theorem 

We denote by Mi(f2) the space of probability measures on a measurable space J-"). 
We define the relative entropy of the probability measure v G Mi{VL) with respect to 
/i G Ml (fi) as 

D{v\\^^) = l J ''^'''''''^dil'^' '"^^^^ (4) 



I +00 (otherwise). 



If there exists a measure a on f2 such that z^, ^ a, D(zy||/z) is also denoted by D{q\\p) 

, dv da 

where q '■= — and p := — . 

da da 

Let us explain the formulations of relative entropy due to Araki and to Uhlmann 
[H |42l [17]. Let (A^, "H, J, V) be a standard form of a von Neumann algebra A^, and tp, i/j 
be normal states on Ai. There exist unique $, G P such that f{A) = ($, v4$), tp{A) = 

A^) for all A e A. The operator 5*$.* with the domain Dom{S^^^) = M"^ + (1 - 
s^'(^))H is defined by 

5$,*(A^ + n) = s-^(^)A*$, A G M,s^\^)n = 0, 

where s-^(\&) is A^-support of \E', i.e., the minimal projection i? in such that (1 — 
i?)\E' = 0. S^^^i is seen to be a closable operator. Then the relative modular operator 

/oo 
XdEij, \i,{X) be the spectral 
-oo 

decomposition of A$^,j,. Araki's relative entropy S {^p\\ip) Ara^ki is defined by 



Araki 



logA d(<l>,E$,,i,(A)<l>), 

J 

+00, (otherwise), 



where s{(p) is the minimal projection E in Ai such that Lp{l — E) = and called the 
support of ip. 

We then define Uhlmann's relative entropy. For two seminorms p and g on a complex 
linear space C, the quadratical mean QM{p, q) is defined by 

QM{p,q){x) = sup a(x,a;)2, x G £, 

a£S(p,q) 
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where S{p,q) is the set of all positive hermitian forms a on £ satisfying < 
p{x)q{y) for all x,y G C A fuction [0, 1] 3 t pt whose values are seminorms on £ is 
called a quadratical interpolation from p to g if it satisfies the following conditions: (i) for 
each a; G £ the function 1 1— )• pt{x) is continuous; (ii) it satisfies the following properties: 

Pt = QM{pt„pt,), t = ^i^,ti,t2 e [0,1], 

pi = QM{p,q), 
pt_ = QM{p,pt), tG [0,1], 
pi±t = QM{pt,q), t e [0,1]. 

Furthermore, for each positive hermitian forms a and /3 there exists a unique function 
[0, 1] 3 t H- QFt{a, (3) whose values are positive hermitian forms on C such that the func- 
tion Pt defined by Pt{x) = QFt{a, /3){x, x) 2 for each x G £ is the quadratical interpolation 
from a{x, x)2 to I3{x, x)^. The relative entropy functional S{a\\f3){x) of a and /? is defined 
by 

^ ^ t-^+o t 

Let ^ be a *-algebra, and he a positive linear functionals on A. The Uhlmann's 
relative entropy >S'((/?||'i/')uhimann is defined by 

where (f^ and ifj^ are the positive hermitian forms given by if^{A, B) = (p{BA*), ip^{A, B) = 
iIj{A*B). 

In [T7], the following three important theorems are proved: 
Theorem 2. For any positive linear functionals ifjtp on a von Neumann algebra Ai, 

Uhlmann Araki ■ 

Therefore, we do not need to distinguish Araki's relative entropy and Uhlmann's one, 
and call them the quantum relative entropy. They are denoted by S^LpWip). 

Theorem 3. Let lp,iIj be positive linear functionals on a C* -algebra 21 and tt be a nonde- 
generate representation of 01 on a Hilbert space. If there are the normal extentions (p^ ifj 
of(p,^j to 7r(2t)" such that (p{A) = <p{Tc{A)),^p{A) = ^/^(7r(A)), then S{^\\^) = S{(p\\ilj). 

We take as tt a GNS representation 7r^+^ corresponding to + -0 in this paper. The 
following theorem is our main concern in this section. 

Theorem 4 (Hiai-Ohya-Tsukada Theorem[l7]). Let Lp,ifj be states on 21 with bary centric 
measures /i, u, respectively. If there exists a subcentral measure m such that fi,u m, 
then S{ip\\ijj) = D{fi\\u). 

By this theorem we can calculate the quantum relative entropy as the measure- 
theoretical relative entropy. 
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3.2 An Application of Theorem 4 

Let 21 be a C*-algebra, uji,uj2 states on 21, and (vr^^, "H^^, f^^) a GNS representation of 21 
with respect to uj = uji + U2. We define tlie set of measurements comparing uji with 0J2 
as follows: 

A=A''(iTTu:{%)" 

M{A]Ui ^^2) = {{A,a) G 7r^(2l)" x Aut{Ti^{^)" A)\ 

A = Co(A)",/i,^ < ^a;2.(^,„)}, 

where 

= a*(a;i ® mX) ° {t^lv ® id), 

and the C*-algebra of the composite system is 21 ®m Cq{A). In order to give the validity 
of the definition of M(a;i -< U2), we prepare the next lemma. 

Lemma 1. Let uj\,u)2 he states on a C* -algebra 21, a G AutiJH). Then, S{uji\\u2) = 
S{a*ui\\a*uj2). 

Proof. For a CP map A on 21, it holds that 

SiA*uJi\\A*U2) < S{uJi\\i02), 
which is a well-known fact [T71 [33] . Thus, 

S{a*ui\\a*uj2) < S'(a;i||a;2). 
Since is also an automorphism on 21, 

S{ijJi\\u2) = S{{a~^ya*Ui\\{a^^)*a*uj2) < S{a*Ui\\a*U2)- 
The lemma is proved. □ 
If M(a;i -< U2) ^ 0, then for a (A, a) G M(a;i ^ ^2), 

= S{a*{uJi ® m^) o (tt^^ (g) zci)||a*(d;2 ® m^) o (tt^^ zd)) 

= S'(a*((I;i (g) m^)||a*(a)2 ® m^)) 

= ^(cDi (g) ?Ti_4||a;2 ® m_4) = S'(a)i||cl;2) = S{ui\\u2)- 

The proof of this equality is essentially based on the use of {A, a) G M{uJi -< W2), Theorem 
2 and Lemma 1. The central measures /^wi a) ? (a states ijJi,U2, respectively, is 
specified by measured data, since Born rule is equivalent to the central measure of the 
state of the composite system after measurement. We conclude that we can examine 
the quantum relative entropy S'(a;i||a;2) of ui with respect to U2 statistically, whenever 
M(a;i -< C02) ^ 0. 

By the way, we may conjecture the following statement: 

Conjecture 1. // S'(a;i||a;2) < 00, then M{u!i -< 102) 7^ 0. 

This is, of course, not trivial mathematically. The strong condition S{ijJi\\u2) < 00 
should be regarded as a physically affirmative statement that Ui and UJ2 are so simi- 
lar that the former can be compared with the latter by the quantum relative entropy 
S'(a;i||a;2). Thus it is expected that the quantum relative entropy corresponds to the 
measure-theoretical relative entropy using specific probability measures. In the rest of 
the paper, we assume a condition such as M(ci;i -< ^2) 7^ 0. 



8 



4 Large Deviation Type Estimate 



In this section, two estimates based on large deviation principle are discussed. One of 
them is Stein's lemma and the other Sanov's theorem. These are regarded as the standard 
procedure to give the statistical meaning to the relative entropy. We show here that in 
quantum systems the quantum relative entropy has the same meaning as the measure- 
theoretical relative entropy has. 



4.1 Another Version of Quantum Stein's Lemma 

Assume that M{uji -< UJ2) 7^ 0, and for a (^4, a) G M(a;i -< U2) the Radon-Nikodym 
derivative - — hl^z^ jg strictly positive. We are now ready for hypothesis testing. 

Definition 2. A (decision) test T is a sequence of measurable functions Tn '■ (supp ^^^^ ^j)" 
— {0, 1} with interpretation {Tn = 0} = {Hq is accepted} and {T„ = 1} = {Hi is accepted}. 
IfT= (T), we do not distinguish T and T. 

We define the error probabilities 

ai^'")(r„) = Fif'")(T„ = 1) = Fif'")(i7o is rejected), 
/3i^'")(r„) = Fi^'")(T„ = 0) = is rejected). 

of the first kind and of the second kind, where Pif is the countable product probability 
measure of /x^;; fo'^ ^ ~ 2. Then, we define the log-hkelihood ratio 



for p =, and the normalized log-likelihood ratio 



Sn — — Xj. 

n ^ 

A test T ={fn: (supp /U^j (4_,))" ^ {0, 1}) defined by 



= { \ 



0, {Sn < Tjn) , 

{otherwise) 



is called the Neymann-Pearson test, and denoted also by = (Z^"), where rj = {rjn) is 
a real sequence. 

Lemma 2. For any < 7 < 1, there exists a rj = {rjn) such that aif''°'\T^") < 7, and 
any other test T = {Tn) satisfying a^'°'\Tn) < 7 must satisfy /3k^'°'\Tn) > /?i'^'"''(T^"). 

See [HI EQ] for proof. It is well known that both of the error probabilities cannot tend 
to zero simultaneously. Thus we consider the next quantity. 

/3(^'")(£) = inf{/3(^'-)(7:„) I Tn : test, ai^'^H?;) < e} 

This is the infimum of /3i'^'"^(T„) among all tests r„ with an^'"^(T„) < e. It is seen 
by Lemma [2] that a decision test attaining the infimum is the Neymann-Pearson test 
-y-jj _ {T^^)^ where rj = {rjn) is determined by e. The following theorem is then proved, 
which is another version of quantum Stein's lemma. 
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Theorem 5. For any e < 1, 

lim -log/3(^'")(£) = -5(a;i||a;2). (5) 

n— >-oo n 

Proof. By [T^ Lemma 3.4.7 and Exercise 3.4.17], 

lim ilog/3(^'")(£) = 

Then, D{^^^_^^^^^\\ 

□ 

This theorem holds for all C*-algebras, that is, for all quantum systems, if two states 
ipji/j are comparable each other, i.e., M((/? -< -i/^) 7^ or M('0 -<(/?) 7^ 0. Therefore, 
we expect that the asymptotic theory of classical hypothesis testing is applicable to the 
quantum case. 



4.2 A Quantum Version of Sanov's Theorem 

Let 21 be a C*-algebra, a; be a state on 21, (tt^j, Hi^, be a GNS representation of 21 
with respect to u. We use 21 ®m Co{A), where A = A* E 7r(^(2l)", as the algebra of the 
composite system of the system and an apparatus measuring A, and a G Aut^ir^i^)" ^A) , 
A = Co{A)", as a measuring interaction. 

We denote by B"" {Mi{E<^^^Co{A))) the Borel cr-field on Mi{E<^^^Co{A)) generated by 
the weak topology |^. For any p = (p^, p2, ■ ■ ■) e (supp fiu^^^^^^)^ and A e i3(supp /ia;(^,„)), 
we define random variables Yj{p) = pj for j = 1, 2, ■ ■ ■ , the empirical measures 

4^'"Hp,A) = 1x:^^.(p)(A)' (6) 
i=i 

and 

Q(A«)(r) = p(A«)(4A«)er), (7) 

for r G {Mi{E<^(^^Co{A))) such that {L^f''"^ G T} is measurable, where Pi'^'"'' denotes the 
countable product measure of p^^^^ . Then we state the main theorem in this subsection, 
the earlier version of which has appeared in [29] . 

Theorem 6. Q^f'"^ satisfies the large deviation principle with the good rate function 

'5(K-)l|W(A,a))-- 

-^(6(n||a;(A,.)) <liminf-logQi^'-)(r) <limsup-logQi^'°)(r) < -^(6(r) 

n^oo n n-^oo n 

(8) 

forV G B^ {Mi{E<H(^^Co{A))) such that {Lif''"^ G F} is measurable, where 

S{b{r)\\u(^A,a)) ■■= inf {S{b{u)\\uj(A,a))W e F,z/ < 

In the case that, for F G B"" {Mi{E<^^^Co{A))) , ^ r°|i^ ^ /^<-^(a,c«)} ^ ^1^^ ^ 

/^<-^(Ac«)} ^'^^ empty, S{h(V")\\uj(A,a)) and S{h{V)\\uj{^A,a)) are defined as infinity, respectively. 

'^See [10] and [TTJ p.261]. 
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Proof. This theorem is easily proved by [101 Sanov's Theorem] and Theorem |H 



□ 



If 21 (8)m C'o(^) is separable, then -E'2i®mCo{A) becomes a compact metric space whose 
metric is defined by 



where the set {Aj ^ 0|j G N} is a dense subset of 2t ®m C'o(^)- In this case, for all 
r G i3'"(Mi(£;a®^Co(A))), {i^n^'"^ G r} is measurable. It is known that S-(Mi(E2i^^Co(A))) 
is equal to B^^ {Mx{E^f^^c^(^A))) , if 21 ®m (^0(^4) is separable, where B^^ {Mx{E^(^^C(,{A))) 
is defined as follows. We denote by B{Ei^(^^Co{A)] the vector space of all bounded Borel 
measurable functions on ^^a^^CoC^i)- For ^ 5(Eac>5^Co(A)), let : Mi(£'2i^,^c'o(A)) 1^ 
be defined by T0(i/) = (0, i/) = ciz^. We denote by ;B'^^(Mi(i?2t®jnCo(A))) the a-field of 
cylinder sets on Mi(£'2i0^Co(yi)), i-e., the smallest cr-field that makes all {r^ measurable 



When one of the measures \i attaining the infimum of upper or lower bound is an 
element of il^^'") = {i^ G £'2i,g,n,Co{A)|&('^) = '^{A.a):'^ ^ -E'aiiV' • TTi^-normal}, there exists at 
least one state such that S{h{y)\ixi(A,a)) = S{^\ijS). We can therefore reach the state of 
the system by the use of measured data. 

Remark 1. Optimization of measurement is not considered in this subsection. The set- 
ting of Sanov's theorem requires only the knowledge of the methods for collecting and 
accumulating data. As a result, we should use a measurement {A, a) which is analyzed in 
detail in a quantum system under consideration. 

5 Quantum Model Selection 

First, we define a (parametric) model of states. 

Definition 3. A family of states {ujg\9 G 0} on 01 is called a (statistical) model if it 

satisfies the following conditions: 

(i) Q is a compact subset ofM.^ {d G N).- 

(a) there exist 6i, - ■■ ,6g G such that, for every 6 E Q, ug is ir-normal, where tt = 



Assume that each model in this section is factor for simplicity. A predictive state 
is a quantum version of predictive (probability) distribution and is a function, which is 
constructed of a model of states, from data into states. In this section, our purpose is 
to define a quantum version of information criteria in order to choose the best predictive 
state from many models. We then define the following measurement. 

Definition 4. For N eN, N different models {ui^g-^\9i G 0i}, ■ ■ ■ , {ujN,epf\dN G Qn} are 

comparable if they satisfy the following conditions: 

(i) there exists ip E E% such that M((/? -< {ujjfi.}j=i^...^N) 7^ 0/ 

(a) For a (A, a) G M{ip -< {uj^e,}j=i,-,N), 

(ii-1) there exists a subcentral measure m on -Ea®mCo{A) such that fJ-^^o^^ <^ m {j = 
1, 2, ■ ■ ■ , A^), where ^J'^/g°'^ is the central measure of Uj^g.^i^A,a) (i = !> 2, ■ ■ ■ , A^); 
(ii-2) the support of fi^j^g"^ is independent of 6j G 0j. 

^See [111 P-263] 





ue-^ H \-u}0g ■ 
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The triplet ^ = ((/?, (A, a), m) is called a predictive measurement for {wi^ej^i G ©i}, • • • ; 

It is easily seen that {wi^gj^i G ©i}, {^2,021^2 ^ ©2} and {u3^o^\9s G ©3} are not 
necessarily comparable, even if {a;i 0j6'i G ©1} and {a;2,92l^2 ^ ©2} are comparable and 
so are {a;2,92l^2 e ©2} and {u^^e^lds G ©3}. 

Let {ui^0-^\9i G ©i},--- ,{oJN,eN\^N £ ©Af} be predictively comparable and ^ = 
{(fi, {A, a),m) a predictive measurement for them. For j = 1, ■ ■ ■ , N, 

'^j,e,,{A,a) = a*{ujj^g^ (g) m^) O (tT^ (g) id) 

(A,a) 



dP-.y. 

dm 



and put pL'. (p) = — (p). We denote by = {pi,P2,-"" iPn} the set of states 
^ dm 

consisting of n elements. Then we define a maximal likelihood predictive state Ug^^ „-^, 



where 0j{p"') = argmax TT j9|-^'"''(pj) for p" = {pi,p2,--- ,Pn}, and a Bayesian Escort 

I^j,e,{B)Ut^p'j^f{p^)Sidj)d0, 



i=l 

predictive state 



/nr=iMifV.)^vr,(%)d^, 



for j = 1,2, ■ ■ ■ ,N, where T^j{Oj) is a probability density on ©j and /3 > 0. It then holds 
that 



^jl.MAa) = PPQ-T^ip) dm{p). 



where 



, ii^:\p)m^,i^:\p^'-M)d9, 

pI^U (p) 



^YY:=J^e:\p^Y^AOMo, 

Futhermore, we can show the following equalities by Theorem HJ 

where q{p) = — — 7^— ^(p)- Thus we can define information criteria for quantum states, 
dm 

owing to discussion on the formulation of that for probability distributions [21 HU |l5l |16] . 
Information criteria for quantum states, such as 



Aic, ^ -ix:.ogp5J;t,(p.)4 

WAIC, = --X:iogpf4J")(p.) + ^V, 
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are defined for j = 1, ■ ■ ■ , iV, wliere dj = dim Qj, and < /3 < oo, tlie functional variance 
Vj^'"^ defined by 

n 

1=1 

The a posteriori mean {■)'^. ^ in Eq.(l9]) is defined by 

{GiOAV s = '>^^^-^p''^ \ (10) 

for a given function G{6j) on 6j. We can clioose tlie best model from {uifi^\di G 
9i},--- , {wTv.Sivl^Af ^ ©Af} by information criteria in the setting of a predictive mea- 
surement for them. Furthermore, it is obvious by the above discussion that the accuracy 
of the estimation for quantum states are the same as that for probability measures in the 
classical case. Thus we have demonstrated the validity of the use of information criteria 
for models of quantum states. On the other hand, we can conclude that what makes 
applications of information criteria for quantum states difficult is whether measurements 
which can compare all models of states under consideration exist or not. 

6 On the Quantum a-Divergence 

We discuss the quantum a-divergence and its application here. 



6.1 The a- version of Hiai-Ohya-Tsukada Theorem 

We define the a-divergence 

f D{u\\^x), (a = +l). 



y) = < 



dm 



dfi \ f dv \ 



dm J \ dm J 



(|a|<l), 
[ D{^i\\v), (a = -l), 



for probability measures /i, on a measurable space J-") which are absolutely continuous 
with respect to a measure m on J-"), Uhlmann's a-divergence 



S'(")((/?||^/^)uhlmann = < 



' 5'(^/j||V2)uhlmann, (« = +!), 

^ (l-QFi±,(^^,V/)(l,l)), 



1 -a2 

(l«l < 1), 



for states (/?, "0 on a *-algebra, and Araki's a-divergence 



Araki i 



{a = +l) 



4 1 

'1-(M/,A,^^ VI/)), 



1 -a2 

(l«l < 1), 

. ^(</'IIV^)Araki, (a = -1) 
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for normal states (p = ($, • $), "0 = (\&, ■ on a cr-finite von Neumann algebra Ai, where 
$, \1/ are elements of a self-dual cone of a standard form. It is easily checked that 

S^''\fx\\jy)Ar.^, = D^''\f,\\u), 

for /i.z/ ^ m on a von Neumann algebra L°°{Q, m). 

The following three theorems are the a- analogue of theorems in subsection 3.1. Proofs 
are given in Appendix A. 

Theorem 7. For any positive linear junctionals ip,ip on a von Neumann algebra Ad, 

5'(")(v9||V^)uhlmann = || ^Z") Araki ■ 

By this theorem, it is not necessary to distinguish Araki's a-divergence and Uhlmann's 
one. We call them the quantum a-divergence, and denote them by S^°'\Lp\\ip). 

Theorem 8. Let -0 he positive linear junctionals on a C* -algebra 21 and tt be a nonde- 
generate representation ojOl on a Hilbert space. Ij there are the normal extentions (p^ip oj 
if^i/j to 7r(2t)" such that ip{A) = if{n{A)),tlj{A) = ^in{A)), then S^^'^ipWip) = S^^^ipWip) . 

Theorem 9. Let ip, ip be states on 21 with barycentric measures /i, u, respectively. Ij there 
exists a subcentral measure m such that /i, z/ < m, then S'^''\^\\i)) = D^''\^\\iy). 

6.2 A Quantum Version of ChernofF Bound 

We use the same notation in subsection 3.2. 

M{oouuj2)= [j M{A;ui,U2), 

A=A*e7r„(2l)" 

M{A;couco2) = {{A, a) E 7r^(2l)" x AutiTr^^)" ® A)\ A = Co(A)", 
l^'^i.(A,c)^l^oj2,(A,c.) < 3m : a subcentral measure}, 

For a {A,a) G M{ui,U2), we define TZif''"\Tn) = 'Kiaif''°'\Tn) + 7r2/3i^'"^(T„), where 
TTi, 7r2 G M such that tti, 7r2 > 0, tti + 7r2 = 1. 

Theorem 10. 

hm inf-log7^(^''^)(^n) = inf logFi(^,^), 

n-i>oo Tn n 0<t<l 

where Ft{^,i:) = QFt{^'^,ilj'^) (1,1). 
Prooj. By [231 Theorem 2.1], 



liminf-log7^(^'")(T„)= inf log / dm{p^^^^^^f-\p, 

n-5-oo T„ n 0<i<l J ^ 



wneie pip — - , p> — - , 

dm ^ dm 



dm = QF,(y,f^,,),V^f^,„))(l®^l,l®^l) 

= QFi(<^^,^^)(l,l) = Fi((^,^). 

□ 
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6.3 Bayesian a-Predictive State 

The concept of a Bayesian a-predictive state or a generalied Bayes predictive state studied 
in earlier papers [H El SOl HI] is a generalization of a Bayes (escort) predictive state which 
appeared in the previous section. However, we can easily see that it is difficult to apply 
this concept in a C*-algebraic setting. Thus we define a class of statistical model. 

Definition 5. A family of states {ujg\9 G 0} parametrized by a compact set in M!^ is 

called a classical model if it satisfies the following three conditions: 

(i) There is a suhcentral measure m on E<^ such that /i^^^ ^ m for every ^ G 0. 

(n) The set |p ^ -^21 Peip) '■= ~^~^(p) > oj" independent of 9 E Q. 
{Hi) oje is Bochner integrable. 

Let us define a Bayesian a-predictive state. 

Definition 6. Let {(jjg\6 G 0} be a classical model, tt[6) be a probability density on 0, 
P" = {Pi) ■ ■ ■ ) Pn}, — 1 < a < 1. A state 

= 7^ / PPttApIpI dm{p), (11) 

is called a Bayesian a-predictive state, where 



P^,a(p|p") = 
a posteriori probability density 



{jpeipy-^rrie\p-)dey-\ (a ^ 1) 
exp{J\ogpg{p)7Tie\p^)d9), (a = l) 



J 7r(^) Uj=iP0iPj)d^ 

and 

= I P.ApIp") dm{p). 

We prove the following theorem to justify the use of a Bayesian a-predictive state in the 
context of quantum statistical decision theory, which is first given by pLj and generalized 
by [HI Ellin]. 

Theorem 11. For a state-valued function 1— )■ t/?^" such that it has a bary centric measure 
which is absolutely continuous with respect to m, its risk function 



R^'-\ujey-)= II S^'^\ueW)\[dpe{Pi)T^{9)d9 
^ ^ i=i 



(13) 



is minimized at ut ', . 



Proof. We prove this theorem only when a 7^ ±1, since a = ±1 can be proved by similar 
argument. 

S^-\coey^'') = I (l-p,(p)VgP"(p)^)d^(p), 
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where = — — 
dm 



R^''\ip,\\coe) - R^''\iP2\\uJe) 

^ 1=1 

n 

1=1 




1 — a 



1^ // yP'^P^'^UMmdej {q('{p)'^ -qf{p)'^)dm{p)fldm{p,) 



1 — a 



where = / Wpe{pi)T^{e)de. When (^2 is equal to o;^"^, 

i?(")(^i||a;,)-i?(")(a;,,J|c^,) 

/n 
i=l 

/n 
(c,^:j^5(")«ji<^f K(p")n^^(A) > 0. 

The last equality holds when (^^" equals to a;^"^,, since <^^" is arbitrary. This means 

R^^\if\\uje)>R^"'\uj-^J\uJe) 
for any state- valued function i^''". □ 

7 Conclusion 

We have proved the validity of the use of Hiai-Ohya-Tsukada theorem on the basis of 
sector theory and measuring processes. As a result, a quantum version of Stein's lemma 
and of Sanov's theorem are proved, and it turns out that the quantum relative entropy is 
a rate fuction in large deviarion theory. In addition, we have formulated the measurement 
which allows us to apply information criteria, and defined information criteria for quantum 
states. It is shown that their accuracy is the same as that in classical case. However, 
there is plenty of room for deepening measurements which we can evalulate for each model 
equivalently. 

Let us compare the results in this paper with past studies. The methods in this paper 
are extensions of classical ones, which suit modern statistics and stand different views from 
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[i6\ [19] and succeeding investigations. In particular, model selection using information 
criteria compares with hypothesis testing as methods for constructing models and testing 
hypotheses, and is expected to apply to quantum systems effectively. In asymptotic theory 
of quantum hypothesis testing, we could consider the universal situation increasing the 
number of measured data in this paper, while the special but important situation that 
the number of quanta in the system increases were examined in past studies O [TH |15l 
[T8l [2T| |23| [211 [25] . It is common for both of them that optimization of measurement is 
vital. We hope that both universal and special methods in quantum statistical inference 
will be developed in the future. 

A Proofs of Theorems 

Proof of Theorem 7. It suffices to prove for |a;| < 1. It is proved by discussion in [l7l 
p. 129] that 

gFi(¥P^,^^)(l,l) = (v[/,A;-^vl/). 



Therefore, we have 



^^°H</'ll^)uhlmann = (1 - QFi+^ (</.^, V'^) (1 , 1) ) 

1 — 2 

4 1 l + a 



1 - a2 

□ 

Proof of Theorem 8. It suffices to prove for |a| < 1. By [17i Lemma 3.1], it is proved that 

QFi(y.^,^^)(A^) = QF,(^^,^^)(7r(A),7r(A)), 
where vr = 7r<^+^. Thus we have 

s^'^H^M = -^(i-gFii,((^^,^^)(i,i)) 

^ (i-gFii,((^^,^^)(^(i),^(i))) 



1 -a2 



□ 



Lemma A.l. Let ^\b^'^\b be the restrictions of states ip,!/) on a von Neumann algebra 
M. to a *-subalgebra B to M.. Then, 

SP{vU) := S^'^\ip\BU\B)<S^^\^U). 

Proof. It suffices to prove it for |a| < 1. |e : Em — > Eb is the dual of the embedding 
B 3 B B e M. Using [l2l Proposition 9] and Theorem 8, 

QF,(vp^||^^)(l,l) = Q/,(v.^||V^^)(l)2 

= gF,(Me)^ii(^i^)^)(i,i). 
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Thus we have 

1 — 2 
1 — 2 

□ 

Proof of Theorem 9. It suffices to prove for |a| < 1. 

We define T : 21 ^ C{E^) by T{A){uj) = u{A), and its dual map T* : Mi{E^) -> 

by 

(r*A)(A) = j uj{A) d\{uj) = \{V{A)), 
for A G Mi(£'2t)- r satisfies 

r(i) = 1, T{A*) = r{Ay, r{A*)r{A) < r{A*A), 

for any A G 21. Thus, 

By assumption, m is a subcentral measure /ic associated with a von Neumann subalgebra 
C of the center 3x(2l) = ^(Sl)" H %(2t)' of a state x ^ -E'a- We define a ^-isomorphism 
L°°{E^,m) C by 

(fi^,7r^(A)«;^(/)r]^) = y f{p)p{A) dm{p). 

It is then proved in [T7I, Example 2.6 and Theorem 3.2] that 

V{l^rn{f)) = j f{p) dpip), 

^Mf)) = J fip) dv{p), 
where (p^il) are the normal extensions of Lp^tp to 7r^(2l)". By Lemma [A.lt 

□ 
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